Preconditioning via Diagonal Scaling
نویسندگان
چکیده
Interior point methods solve small to medium sized problems to high accuracy in a reasonable amount of time. However, for larger problems as well as stochastic problems, one needs to use first-order methods such as stochastic gradient descent (SGD), the alternating direction method of multipliers (ADMM), and conjugate gradient (CG) in order to attain a modest accuracy in a reasonable number of iterations. The condition number of a matrix A, denoted by κ(A), is defined as the ratio of its maximum singular value to its minimum singular value. Both theoretical analysis and practical evidence suggest that the precision and convergence rate of the first-order methods can depend significantly on the condition number of the matrices involved in the problems. As an example, the CG algorithm for solving the linear system Ax = b achieves a faster rate of convergence when the condition number of A is smaller. Hence, it is desirable to decrease the condition number of matrix A by applying a transformation to it; this process is called preconditioning. A special case of preconditioning is called diagonal scaling. Here, we are interested in finding diagonal matrices D and E to minimize the condition number of the matrix A = DAE, in order to accelerate first-order methods. For example, in applying CG to solve Ax = b, we can solve Ax̃ = Db instead, and recover x = Ex̃, while taking advantage of the small condition number of A. In our numerical experiments and from theoretical analysis, we have concluded that preconditioning can improve the performance of the first-order methods in two different ways. First, it can significantly accelerate the linear algebra operations. For example in [OCPB13], each step of the algorithm involves running CG which can be done remarkably faster if the appropriate preconditioning is applied. The second effect of the preconditioning, which should be distinguished from the first one, is decreasing the number of iterations for achieving desired accuracy by following different intermediate points in ADMM. In this report, we first discuss heuristics for diagonal scaling. Next, we motivate preconditioning by an example, and then we study preconditioning for a specific splitting form in ADMM called graph projection splitting. Finally we examine the performance of our methods by some numerical examples.
منابع مشابه
Dynamic scaling based preconditioning for truncated Newton methods in large scale unconstrained optimization
This paper deals with the preconditioning of truncated Newton methods for the solution of large scale nonlinear unconstrained optimization problems. We focus on preconditioners which can be naturally embedded in the framework of truncated Newton methods, i.e. which can be built without storing the Hessian matrix of the function to be minimized, but only based upon information on the Hessian obt...
متن کاملOn Algorithms For Permuting Large Entries to the Diagonal of a Sparse Matrix
We consider bipartite matching algorithms for computing permutations of a sparse matrix so that the diagonal of the permuted matrix has entries of large absolute value. We discuss various strategies for this and consider their implementation as computer codes. We also consider scaling techniques to further increase the relative values of the diagonal entries. Numerical experiments show the eeec...
متن کاملRow Scaling as a Preconditioner for Certain Nonsymmetric Linear Systems with Discontinuous Coefficients
Linear systems with large differences between coefficients, called “discontinuous coefficients”, arise in many cases in which partial differential equations (PDEs) model physical phenomena involving heterogeneous media. The standard approach to solving such problems is to use domain decomposition (DD) techniques, with domain boundaries conforming to the boundaries between the different media. T...
متن کاملEffective and Robust Preconditioning of General SPD Matrices via Structured Incomplete Factorization
For general symmetric positive definite (SPD) matrices, we present a framework for designing effective and robust black-box preconditioners via structured incomplete factorization. In a scaling-and-compression strategy, off-diagonal blocks are first scaled on both sides (by the inverses of the factors of the corresponding diagonal blocks) and then compressed into low-rank approximations. ULV-ty...
متن کاملGeometric scaling: a simple preconditioner for certain linear systems with discontinuous coefficients
Linear systems with large differences between coefficients (“discontinuous coefficients”) arise in many cases in which partial differential equations (PDEs) model physical phenomena involving heterogeneous media. The standard approach to solving such problems is to use domain decomposition (DD) techniques, with domain boundaries conforming to the boundaries between the different media. This app...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014